An Efficient Parallel Sorting Algorithm on Metacube Multiprocessors
نویسندگان
چکیده
Parallel sorting algorithms in hypercubes have been studied extensively. One of the practical parallel sorting algorithms is Bitonic Sort, which is implemented in O(n) time for sorting N = 2 numbers in an n-cube. A versatile family of interconnection networks alternative to hypercube, called metacube, was proposed for building extremely large scale multiprocessor systems with a small number of links per node. A metacube MC(k,m) connects 2 m+k nodes with only k + m links per node. In this paper, we present an efficient sorting algorithm on metacube multiprocessors. The proposed sorting algorithm is based on the Batcher’s bitonic sorting algorithm. In order to perform the parallel sorting efficiently in metacube, we give a new presentation of the metacube such that the communications required by the algorithm can be done efficiently with gather and scatter operations. The parallel bitonic sort algorithm implemented in metacubes with the new presentation runs in O(2m + k) computation steps and O(2m(2k + 1) + k) communication steps.
منابع مشابه
Efficient Communication in Metacube: A New Interconnection Network
This paper introduces a new interconnection network for very large parallel computers called metacube (MC). An MC network has a 2-level cube structure. An MC(k,m) network connects 2m2 k+k nodes with m + k links per node, where k is the dimension of a high-level cube and m is the dimension of low-level cubes (clusters). An MC network is a symmetric network with short diameter, easy and efficient...
متن کاملDesigning Practical Efficient Algorithms for Symmetric Multiprocessors
Symmetric multiprocessors (SMPs) dominate the high-end server market and are currently the primary candidate for constructing large scale multiprocessor systems. Yet, the design of eecient parallel algorithms for this platform currently poses several challenges. In this paper, we present a computational model for designing eecient algorithms for symmetric multiprocessors. We then use this model...
متن کاملParallel Sorting by Regular Sampling
A new parallel sorting algorithm suitable for MIMD multiprocessors is presented. The algorithm reduces memory and bus contention, which many parallel sorting algorithms suffer from, by using a regular sampling of the data to ensure good pivot selection. For n data elements to be sorted and p processors, when n ≥ p 3 the algorithm is shown to be asymptotically optimal. In theory, the algorithm i...
متن کاملBalanced Parallel Sort on Hypercube Multiprocessors
A parallel sorting algorithm for sorting n elements evenly distributed over Z d = p nodes of a d-dimensional hypercube is presented. The average running time of the algorithm is O( ( n log n)/p + p log2 n). The algorithm maintains a perfect load balance in the nodes by determining the (kn/p)th elements (k = 1,. . . , ( p 1)) of the final sorted list in advance. These p 1 keys are used to partit...
متن کاملAn Empirical Analysis of Parallel Random Permutation Algorithms ON SMPs
We compare parallel algorithms for random permutation generation on symmetric multiprocessors (SMPs). Algorithms considered are the sorting-based algorithm, Anderson’s shuffling algorithm, the dart-throwing algorithm, and Sanders’ algorithm. We investigate the impact of synchronization method, memory access pattern, cost of generating random numbers and other parameters on the performance of th...
متن کامل